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ABSTRACT 

We combine Hubble Space Telescope ( HST) G102 and G141 near-IR (NIR) grism spectroscopy with //5T/WFC3- 
UVIS, //5T/WFC3-IR, and Spitzer/IRAC [3.6 /<rn] photometry to assemble a sample of massive (log (M star /M 0 ) ~ 

11.0) and quenched (specific star formation rate < 0.01 Gyr -1 ) galaxies at z ~ 1.5. Our sample of 41 galaxies 
is the largest with G102+G141 NIR spectroscopy for quenched sources at these redshifts. In contrast to the 
local universe, z ~ 1.5 quenched galaxies in the high-mass range have a wide range of stellar population 
properties. We find that their spectral energy distributions (SEDs) are well fitted with exponentially decreasing 
star formation histories and short star formation timescales (r ^ lOOMyr). Quenched galaxies also show a 
wide distribution in ages, between 1 and 4 Gyr. In the (u — r)o-versus-mass space quenched galaxies have a 
large spread in rest-frame color at a given mass. Most quenched galaxies populate the z ~ 1.5 red sequence 
(RS), but an important fraction of them (32%) have substantially bluer colors. Although with a large spread, 
we find that the quenched galaxies on the RS have older median ages (3.1 Gyr) than the quenched galaxies off the 
RS (1.5 Gyr). We also show that a rejuvenated SED cannot reproduce the observed stacked spectra of (the bluer) 
quenched galaxies off the RS. We derive the upper limit on the fraction of massive galaxies on the RS at - ~ 1.5 to 
be <43%. We speculate that the young quenched galaxies off the RS are in a transition phase between vigorous star 
formation at z > 2 and the z ~ 1 .5 RS. According to their estimated ages, the time required for quenched galaxies 
off the RS to join their counterparts on the z ~ 1.5 RS is of the order of ~1 Gyr. 

Key words: galaxies: formation - galaxies: high-redshift - galaxies: stellar content - Galaxy: evolution - infrared: 
galaxies - surveys 

Online-only material: color figures, extended figure 
1. INTRODUCTION 

The formation and evolution of massive galaxies are some of 
the most studied and debated topics in extragalactic astronomy 
today. In the local universe the most massive galaxies primarily 
populate a well-defined relation in the color-magnitude space 
known as the “red sequence” (e.g., Kauffmann et al. 2003; 

Baldry et al. 2004). These galaxies host mostly old, passively 
evolving stellar populations (e.g., Trageret al. 1998; Kuntschner 
et al. 2001; Mehlert et al. 2003; Sanchez-Blazquez et al. 2006; 

Renzini 2006 and references therein). The major processes 
behind mass assembly and structure formation of massive 
galaxies, however, are controversial. 

In particular, the role of major mergers (as advocated in, e.g., 

Mihos & Hernquist 1994; Bekki 1998; Hopkins et al. 2006) in 
the mass buildup and in the quenching of the star formation 
has recently been challenged by new evidence pointing toward 
a mass-induced truncation of the star formation (e.g., Peng 
et al. 2010, 2012). Evidence supporting the merger picture is 

* Based on observations with the NASA/ESA Hubble Space Telescope, 
obtained at the Space Telescope Science Institute, which is operated by 
AURA, Inc., under NASA contract NAS 5-26555. 


mostly restricted to z < 1 from the tightness of the galaxy 
color-magnitude relation (e.g., Stanford et al. 1998; Faber 
et al. 2007), the evolution of the mass function of red-sequence 
galaxies (e.g.. Bell et al. 2004; Faber et al. 2007; but see Cimatti 
et al. 2008; Scarlata et al. 2007), and the small scatter in the 
mass-to-light ratios of these systems (e.g.. Kelson et al. 2000). 
The super-solar a-element abundances found in local massive 
galaxies (e.g., Nelan et al. 2005; Thomas et al. 2005) can be 
reproduced by both merger- and mass-induced quenching as 
long as the process took place several Gyr ago and on short 
timescales (<1 Gyr; Thomas et al. 1999; Renzini 2009). 

Recently, minor mergers have also been considered as a viable 
mechanism for the size growth of massive galaxies, gaining 
more popularity after the discovery of the strong size evolution 
of early-type galaxies (e.g., Bundy et al. 2009; de Ravel et al. 
2009; Lopez-Sanjuan et al. 2010; Ryan et al. 2012). The 
discovery of the compact nature of massive quenched galaxies at 
z ~ 2 implies they must become -~2-4 times larger with time to 
match the sizes of quenched galaxies in the local universe (e.g., 
Daddi et al. 2005; Trujillo et al. 2006; van Dokkum et al. 2008; 
McLure et al. 2013). Other authors, however, have suggested 
that the evolution of the mass-size relation is primarily driven 
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by the appearance of new large galaxies at later time rather 
than by the growth of individual objects (e.g., Valentinuzzi et al. 
2010; Poggianti et al. 2012; Carollo et al. 2013; Cassata et al. 
2013). 

The epoch between 1 < z < 3 is crucial for the assembly 
of massive galaxies, as most of their size and number density 
evolution seem to have taken place during this time (e.g., 
Fontana et al. 2006; Kriek et al. 2008; Marchesini et al. 2009; 
Ilbert et al. 2010; Cassata et al. 2011; Conselice et al. 2011). 
Critical to study this redshift range are data covering the rest- 
frame optical, and in fact, the past few years have seen the 
completion of deep ground- and space-based photometric IR 
surveys. Although large samples of both star-forming (SF) and 
quenched massive galaxies at 1 < z < 3 can be assembled 
using color-selection techniques (e.g., Daddi et al. 2004; Lin 
et al. 2012; Newman et al. 2012 and references therein), results 
based on them are still hampered by the lack of spectroscopic 
redshifts, which translates into strong systematic effects on 
stellar population parameters (including, but not limited to, 
stellar masses and luminosity-weighted ages). Moreover, the 
mutual contamination between color-selected SF and quenched 
samples can only be established statistically as, typically, no 
further information is available for individual sources (e.g., 
~30% contamination for BzK selection; Cameron et al. 2011). 
Detailed spectroscopic studies on individual sources are still 
extremely rare and limited to the brightest, non-representative 
sources (Kriek et al. 2009; van Dokkum et al. 2010; Ferreras 
et al. 2012). 

Additional complications appear when quiescent galaxies at 
Z > 1 are considered. Without emission lines, the redshift 
determination must be constrained with continuum-emission 
features, like the 4000 A and Balmer breaks, redshifted to the 
near-IR (N1R) regime. This requires deep IR spectroscopic 
observations, which (until recently) were limited to single- 
object spectroscopy. These studies have shown that the bulk 
of the star formation in massive quenched galaxies took place 
between 2 < z < 4 with formation timescales below 1 Gyr 
(Longhetti et al. 2005; Kriek et al. 2006; Newman et al. 2010; 
Onodera et al. 2012). Ground-based NIR spectroscopy, however, 
is limited to the J, H, and K atmospheric windows and suffers 
from high background emission and time-variable absorption 
features. Spectroscopic observations from space, then, represent 
the best way to acquire large samples of quiescent galaxies at 
Z > 1 with homogeneous spectroscopic data sets. 

Here we present a stellar population study of a sample of 
quenched galaxies at 1.0 ^ z < 2.0 discovered in the Wide 
Field Camera 3 (WFC3) Infrared Spectroscopic Parallel survey 
(WISP; Atek et al. 2010). The Hubble Space Telescope (//ST) 
NIR grism spectroscopy covers the wavelength range 0.9 pm 
^ k ^ 1.6 p m and allows us to study quenched high-z galaxies 
on an individual basis. 

The paper is structured as follows. In Section 2, we present 
the observations and analysis of photometry and spectra. In 
Section 3, we describe the color and magnitude selection of 
our sample. In Section 4, we characterize our galaxy sample, 
including the calculation of spectrophotometric redshifts, stel- 
lar population parameters, and estimates of their systematic and 
random uncertainties. In Section 5, we discuss our main results, 
and we summarize them in Section 6. Five Appendices describ- 
ing details in the data analysis are included at the end of the 
paper. 

In this paper, we assume a flat cosmology with Hq = 
70kms _1 Mpc -1 , = 0.3, and il\ = 0.7. Photometric 


magnitudes are always expressed in the AB system (Oke & 
Gunn 1983). 

2. OBSERVATIONS AND DATA 

The sample presented in this work was extracted from the 
first 27 WISP fields observed with the WFC3 on board the 
HST. Specifically, these fields were observed with both WFC3 
grisms and with both WFC3/IR and WFC3/UVIS cameras. 
The WFC3/IR provides a plate scale of O'.' 13 pixel 1 , over a 
total field of view (FOV) of 123" x 136". The UVIS channel of 
the WFC3 has a plate scale of O'.' 04 pixel 1 , over a total FOV 
of 162" x 162". For seven of our fields the UVIS photometry 
was obtained from 2x2 binned data (see Table 1). All the 
observations were carried out in pure parallel mode. 12 

In each field, imaging was obtained using the F475X, F600LP, 
FI 10W, and F160W filters, with typical exposure times of 400, 
400, 1000, and 500 s, respectively (see Table 1 for details). The 
two deeper fields Par96 and Pari 36 were observed in the UVIS 
bands F606W and F814W instead. The WFC3/UVIS CCDs 
suffer a degradation of their charge transfer efficiency with 
time, affecting F475X (F606W) and F600LP (F814W) fluxes. 
A flux correction was implemented and it is fully described in 
Appendix A. 

Dispersed images were obtained using the WFC3 /IR camera 
and the G102 and G141 NIR grisms. The blue (G102) grism 
provides a resolving power R = 210 for a point source and 
covers the 0.8-1.15 jim wavelength range. The red (G141) 
grism provides a resolving power R = 130 for a point source 
and covers the 1.07-1.7 /im range. The wavelength overlap 
between the two grisms ensures an accurate flux calibration of 
the spectra. Details of reduction and calibration of the WFC3 
data are presented in Atek et al. (2010). 

For 22 of the 27 fields we also obtained Spitzer IRAC obser- 
vations at 3.6 p m (see Table 1 for details). The same pipeline 
used to produce the “Spitzer Enhanced Imaging Products” was 
also run on our data to measure the 3.6 pm fluxes (P. L. Capak 
et al., in preparation). 13 

2.1. Photometric Catalog 

In the analysis of the galaxy sample, we simultaneously make 
use of the broadband fluxes and NIR spectra. Thus, special 
attention should be paid to making sure that the aperture used for 
the extraction of the spectra and the aperture used to compute the 
broadband fluxes are the same. In order not to introduce artificial 
correlations among adjacent pixels, we worked on images with 
the original pixel scales. As a consequence, typical softwares 
used to compute object fluxes in matched apertures in different 
images could not be used. 

In order to create the multi-band catalog of aperture-matched 
fluxes, we proceeded as follows. We first used the SExtractor 
software (Bertin & Arnouts 1996) to detect objects in the 
deepest lowland images and to compute the parameters 
of the Kron elliptical apertures (semi-axes and position angle). 
The Kron apertures, as computed by SExtractor, are intended 
to give the most precise estimate of total magnitudes for 
galaxies. Details about the routine can be found in the SExtractor 


12 In the pure-parallel mode, one or more instruments sample the HST focal 
plane, while the prime program observes its planned target according to its 
desired visit schedule. 

13 For a full explanation of the pipeline, see also 
http://irsa.ipac.caltech.edu/data/SPITZER/Enhanced/Imaging/ under 
"Explanatory Supplement.” 
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Table 1 

Summary of Observations for the WISP Survey Fields 


Field 

R.A. 

(HMS) 

Deck 

(DMS) 

FI 10W 

(s) 

G102 

(s) 

F160W 

(s) 

G141 

(s) 

F475X 

(s) 

F600LP 

(s) 

IRAC 3.6 /xm 
(s) 

Pai'95 

01 10 04.38 

-02 24 54.9 

909 

5715 

384 

2209 

400 

400 

2100 

Par91 

01 10 05.79 

-02 25 03.3 

909 

5715 

384 

2209 

400 

400 

2100 

Par97 

01 10 06.30 

-02 23 44.7 

859 

5515 

406 

2109 

400 

400 

2100 

Par84 

01 10 07.60 

-02 25 11.6 

1162 

7521 

559 

2809 

400 

400 

2100 

Par79 

01 10 08.96 

-02 25 16.2 

1187 

7521 

534 

2809 

400 

400 

2100 

Par81 

01 10 09.12 

-02 22 17.1 

1187 

7521 

534 

2809 

400 

400 

2100 

Par80 

01 10 09.27 

-02 22 17.7 

1187 

7521 

534 

2809 

400 

400 

2100 

Par89 

01 10 09.96 

-02 22 20.0 

1187 

7512 

534 

2809 

400 

400 

2100 

Par83 

01 10 10.71 

-02 24 09.7 

859 

5515 

406 

2109 

400 

400 

2100 

Par96 

02 09 24.40 

-04 43 41.6 

4295 

28081 

1765 

11430 

3000 a 

3000 a 

2100 

Par74 

09 10 48.14 

+10 17 20.3 

1065 

5918 

431 

2306 

400 

400 

2100 

Par87 

09 46 46.39 

+47 14 58.2 

912 

4915 

406 

1906 

400 

400 

1500 

Pari 14 b 

10 40 58.09 

+06 07 31.0 

1137 

7221 

456 

2909 

600 

600 


Parl31 b 

10 48 22.94 

+13 03 50.5 

2171 

13039 

884 

5215 

600 

600 


Pari 15 b 

11 18 55.08 

+02 17 09.6 

912 

5215 

381 

2106 

600 

600 


Parl35 b 

11 22 24.01 

+57 50 58.9 

862 

4712 

406 

1906 

600 

600 

1500 

Parl36 b 

12 26 28.84 

+05 23 02.9 

3036 

18857 

1137 

7318 

2000 a 

2000 a 


Par76 

13 27 22.17 

+44 30 39.3 

887 

5515 

406 

2006 

400 

400 

1500 

Parl20 b 

13 56 51.50 

+17 02 33.9 

837 

4512 

381 

1806 

600 

600 

1500 

Par73 

14 05 12.86 

+46 59 19.9 

1034 

6118 

456 

2509 

400 

400 

1500 

Par64 

14 37 29.04 

-01 49 49.5 

1112 

5918 

456 

2306 

400 

400 

2100 

Par66 

14 37 29.22 

-01 49 54.5 

1237 

7421 

559 

2909 

400 

400 

2100 

Par67 

15 24 07.75 

+09 54 53.9 

959 

5715 

406 

2209 

400 

400 

1500 

Par69 

15 24 09.75 

+09 54 50.0 

1087 

5721 

431 

2309 

400 

400 

1500 

Par94 

22 05 26.66 

-00 17 48.5 

1624 

9024 

534 

3309 

400 

400 

2100 

Par-68 

23 33 33.04 

+39 21 20.5 

1215 

7721 

534 

3009 

400 

400 

1500 

Parl47 b 

23 58 19.72 

-10 14 56.36 

962 

5418 

406 

2106 

600 

600 



Notes. 

a UVIS observations of this field were acquired using the F606W and F814W filters. 
b UVIS observations binned 2x2. 


manual. 14 For each galaxy, we then computed the total flux in 
all bands, using the Jpnow-band-defined aperture and a custom 
IDL code. 

The elliptical apertures were then appropriately scaled to 
account for different pixel scales between the UVIS and IR 
detectors. The local background for each elliptical aperture was 
calculated within a square annulus using the same prescription 
as in SExtractor. Within each annulus, the background per pixel 
was computed as a 3cr clipped mean. Then, the total background- 
subtracted flux was computed within the aperture by adding all 
the aperture pixels. 

Flux uncertainties were computed using Monte Carlo simula- 
tions. We created 1000 images for each galaxy by randomizing 
the flux in each pixel of the elliptical aperture according to a nor- 
mal distribution with a width provided by the la error map. We 
measured the total elliptical flux as explained above on each of 
the simulated images. We then computed the la errors by fitting 
a Gaussian to the distribution of the 1000 measured fluxes. We 
only retain in the catalog sources with Tpiiowflux larger than 3a 
( J \- 1 ] o w is our deepest NIR band; see Table 1). For undetected 
sources in the other bands, we show the 3a flux limit in the 
figures. 

Because of the significantly lower spatial resolution of the 
Spitzer images, IRAC 3.6 pm fluxes were computed inde- 
pendently from the HST fluxes, using the “Spitzer Enhanced 

14 https://www.astromatic.net/pubsvn/software/sextractor/trunk/ 
doc/sextractor.pdf 


Imaging Products” pipeline, with some modifications (details in 
P. L. Capak et al., in preparation). Briefly, the pipeline utilizes 
the Spitzer MOPEX software package to create a mosaic for 
each IRAC image, before identifying and extracting sources us- 
ing SExtractor. Fixed aperture fluxes were computed for each 
object, using an aperture of 2" 8 diameter. This is a smaller aper- 
ture compared to the typical 3" 8 and 5" 8 diameter apertures used 
in the “Spitzer Enhanced Imaging Products” catalog. However, 
our exposures are significantly deeper than the standard images 
analyzed with the pipeline, so a smaller aperture is more ap- 
propriate to reduce contamination from nearby neighbors. The 
aperture fluxes were corrected to total fluxes assuming that the 
objects are point sources. This is a fair assumption as the FWHM 
of the Spitzer point-spread function (PSF) of 1"5 is larger than 
the FWHM in the NIR of the largest resolved sources used in 
this paper (1"43). 

The Spitzer and WFC3 catalogs were cross-matched using 
target coordinates, by searching within a radius of 1"5. We 
flagged all sources for which multiple objects fell within the 
PSF of the IRAC data. This procedure results in a five-broadband 
photometric catalog for our 27 fields of 15,302 sources brighter 
than 27.5 mag in /pnowband. Of those, 6991 have IRAC 3.6 pm 
detections, 26% of which are blended in this band. 

2.2. NIR Spectroscopy: 2D Cleaning and ID Extractions 

Due to the slitless nature of the spectroscopic observations, 
spectra from different sources may overlap. If not properly 
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F160W [mag] 

Figure 1. Comparison between the magnitudes computed from the extracted 
G141 spectra and those computed from the //fi 60 W images shows an overall 
good agreement between them. The comparison was based on 102 galaxies 
for which the spectral //fi 60 W magnitudes were calculated after contamination 
correction. 

(A color version of this figure is available in the online journal.) 


removed, the flux contribution from nearby sources may sub- 
stantially change the total flux and shape of the extracted spec- 
tra, preventing an accurate fit of the spectral energy distribution 
(SED; e.g., Gobat et al. 2013). In order to perform an optimal 
cleaning of the 2D data (reduced and calibrated with aXe soft- 
ware; Kiimmel et al. 2009; see Atek et al. 2010 for details), 
the ID spectra were extracted using a custom written IDL code. 
The details of our extractions and a comparison with the aXe 
ID extractions are presented in Appendix B. Shortly, while aXe 
assumes a single Gaussian function to describe the spatial light 
profile of each source, we use a double Gaussian function. Our 
approach results in more reliable flux levels compared to extrac- 
tions performed by the aXe code. 

In order to test the quality of our spectral cleaning and 
extraction, we compare the fluxes from the spectra with our NIR 
photometric data. In Figure 1 , we show a comparison between 
the magnitudes computed from the extracted G141 spectra and 
the magnitudes computed from the //fi6ow images for 102 
galaxies. The magnitudes from the spectra were computed 
by convolving them with the throughput of the //hoow filter, 
which is fully covered by the G141 spectral range. The median 
magnitude difference between photometry and grism data is 
—0.01 mag with a lcr scatter of ±0.18 mag. 
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2 3 

Redshift 


Figure 2. /fhow — //F160W color vs. redshift diagram based on Bruzual & 
Chariot (2003) models shows that with our color selection criterion 
Gfhow — f^Fl60W ^ 0.6, dashed black line), we mostly recover simple stellar 
population models (quenched, red triangles) between 1.2 < z < 3.0. Dust-free 
continuous star formation models (blue diamonds) are rejected at all redshifts, 
while some dusty (Ay = 1) continuous star formation models (green crosses) 
at 2.0 < z < 3-0 pass our color cut. For all models, solar metallicity and a 
redshift of formation of 4 have been assumed. 


(A color version of this figure is available in the online journal.) 


3. COLOR-MAGNITUDE SELECTION OF THE SAMPLE 

The 4000 A break fully falls within the wavelength range 
covered by the grism spectra for objects in the 1.2 < z < 2.7 
redshift range. This break can be used not only as a redshift 
indicator but also to study stellar population properties in 
galaxies with sufficient signal-to-noise (S/N) spectra (e.g., 
Ferreras et al. 2009; Hathi et al. 2009; Onodera et al. 2012). 
We limit the analysis of the spectra only to objects with 
/fhow— 7 /fi 60 W A 0.6 and magnitude brighter than //pi6ow = 
23. As shown in Figure 2, the color cut pre-selects sources with 
spectral breaks broadly covered by our spectra for which the 
redshifts can be measured, while the magnitude cut ensures we 
have spectra with sufficient S/N. 

Among all sources with //pi6ow A 23, we removed the stars 
using a size-magnitude diagram as explained in Appendix C. 
After visual inspection of all the sources flagged as stars (315), 
we found that 10 were misclassified galaxies and these were 
re-included in the sample. 

In the left panel of Figure 3, we show the /fhow— Hpi6ow 
versus Rf600lp— /fhow color-color diagram for 1352 galaxies 
brighter than ll \ i eow = 23. The right panel is the corresponding 
plot for the two deep WISP fields, showing /fhow— F/fi60w 
versus /f814W~ /fhow instead (163 galaxies). Sources brighter 
than //pi6ow = 23 constitute ~1 1% of our photometric catalog. 
Out of 283 galaxies with /fhow— 7/fi60w ^ 0.6, 84 (5) are 
upper limits in the T?f600lp (/fsi4w) band, and we show the 3 a 
limits in the color-color plots. Different model tracks showing 
the color evolution with redshift are overplotted (see figure 
caption and legend). Note that at the highest redshift considered 
( z = 2.4), the truncated 200 Myr burst model (orange line) is 
already passive, but significantly bluer in Rf600lp— /fhow than 
the instantaneous burst at the same redshift (red line). Figures 2 
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25 WISPs fields (1,352 gol) 
> 0.6 (238 gal) 


z=2.4 



Instantaneous burst, z F =4 
200 Myr burst, Z F =3 
Constant SFH, z F =4, A/=1 
Constant SFH, z F =4, \=0 


I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 


Par96 & Par136 fields (163 gal) 
J-H > 0.6 (47 gal) 


4 


•• 




-I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I l_ 


0 


1 

Rf 600 LP 


2 

— J Fnow [ab] 


3 


0 


1 

If814W 


2 

— Jmow [AB] 
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Figure 3. Observed-frame color-color plots, showing galaxies from our photometric catalog with /fhow ^ 23, illustrate how our color selection criterion 
Dfiiow — ffFl 60 W 4? 0.6, dashed black line) selects short-starburst galaxies (quenched, red and orange model tracks) and dusty continuous star-forming sources 
(purple model track). Left: we show data from 25 WISP fields using Rf 600 LP _ ^Fltow- Right: we show data from the two deepest WISP fields using 7 f 814W — Allow 
color instead. Arrows indicate 3<r lower limits in Kf 600 LP _ Allow color, while dots indicate proper detections in all bands. The color tracks represent different star 
formation histories from Bruzual & Chariot (2003) models ranging from 1.0 ^ z ^ 2.4 in steps of 0.1: light blue, continuous star formation; purple, continuous star 
formation assuming extinction Ay = 1; red, single stellar population model (quenched); and orange, starburst of 200 Myr duration. For the starburst model, a redshift 
of formation of 3 has been assumed. For all the other models, a formation redshift of 4 was used. All models assume solar metallicity. The reddening vector changes 
as a function of redshift and is therefore not shown. 


(A color version of this figure is available in the online journal.) 


and 3 show that dusty SF galaxies are also selected by our 
Tfhow — F/pi60W color criterion. However, these galaxies can 
be identified later through the SED fitting of their combined 
photometry + spectra (see Section 5). 

We visually inspected both the spectra and the direct images of 
all the objects satisfying the ./fiiow~#fi 60 W color cut to remove 
galaxies for which the spectrum showed instrumental issues 
(e.g., for sources at the edge of the chip for which the spectrum 
is truncated), for which the spectrum was contaminated at a 
level that even our procedure could not recover (e.g., close to 
a bright star), and for which the WFC3 /Spitzer photometry 
showed blended sources. We excluded 149 galaxies in this step. 
Note that about half of them were excluded because the full 
wavelength range of their spectra was not totally sampled within 
the CCD detector (i.e., truncation). We also excluded 32 galaxies 
for which the 2D dispersed stamps did not show any signal. We 
compared these 32 galaxies with the parent sample and found 
that they are at the faint end of the considered magnitude range 
(with average //pigowband of 22.7) and tend to have slightly 
larger radii than the remaining sources of the same magnitude. 
This may introduce a bias against less concentrated objects in 
our faintest magnitude bin. 

Our master sample includes 102 sources with /fhow - Hfi60w 
^ 0.6 and //pi6ow ^ 23. In 23 out of 102 galaxies, we detected 
emission lines. It is important to note that, because in slitless 
mode the spectral resolution depends on the size of the object 
in the dispersion direction, our capability to detect emission- 
line galaxies depends not only on the line flux and equivalent 
width but also on the size of the galaxy itself. A detailed line- 


completeness analysis for the full WISP survey is presented 
in Colbert et al. (2013). The emission-line recovery rate drops 
below 40% for objects larger than 0T6, The median size of our 
sample galaxies is 0 '.'45, corresponding to a completeness of 
about 70%. As discussed in Colbert et al., the completeness 
never reaches 100% because of spectral contamination issues. 
Our sample of 102 galaxies does not suffer from this problem, 
so we expect the completeness rate in our case to be even larger 
than 70%. 

4. ANALYSIS 

Prior to deriving spectrophotometric redshifts and stellar 
population properties (SPPs) from our analysis, we study the 
effects of emission lines in retrieving these continuum-based 
parameters. Our low spectral resolution might dilute emission 
lines in the continuum, affecting the retrieved redshifts and 
SPPs. The results of this study are presented in Appendix D. 
Briefly, we find that in those cases where emission lines are 
diluted in the continuum, the llux contribution from the line 
is below the ler uncertainties in the grism data. Therefore, in 
addition to masking the detected emission lines in the SEDs, 
there is no need to make specific modeling of emission lines for 
our SED fitting process and simulations. 

4.1. Spectrophotometric Redshift 

We compute spectrophotometric redshift for the 102 galaxies 
in our sample by fitting stellar population models (see below) 
to the combination of the photometric and spectroscopic data. 
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Spectro-Phot. Redshlft 

Figure 4. Spectrophotometric redshift distribution for the master sample of 102 
galaxies shows a median value of z = 1.3 (dashed line). 

We developed our own IDL code for this task because of 
the (1) different spectral resolution in the two grisms, (2) 
different spectral resolution in each individual source, and (3) 
simultaneous fit to photometry and spectra. 

We computed spectrophotometric redshifts by fitting 51 
empirical templates from Coleman et al. (1980) and Kinney 
et al. (1996) to the Bf 475 x(^f 606 w)> ^f 600 lpGfsi 4 w)> G102 and 
G141 spectra, and IRAC 3.6 /xm data, when available. Before 
the x 2 minimization, we matched the spectral resolution of the 
templates to that of the data (on an object-by-object basis). 
To do so, we re-binned both the data and the templates to the 
resolution element, computed from the FWHM of the object’s 
spatial profile in the dispersion direction. Given the typical 
size of our sources, the spectral resolution elements are on 
average ~85.8 A (162.8 A) in G102 (G141). We considered 
the 0.0 ^ z ^ 4.0 range with constant redshift steps of 0.05 
and applied intergalactic medium absorption following Madau 
(1995). For sources with emission lines the purely spectroscopic 
redshift based on those features was used instead (19% of our 
master sample). We determined redshifts for single-line galaxies 
assuming they are Ha emission. See Dominguez et al. (2012) 
and Colbert et al. (2013) for further details on emission-line 
redshift estimations. 

We estimated random errors on the redshifts by using Monte 
Carlo simulations. For each galaxy, we varied both the spectra 
and the photometric points within the la uncertainties, assum- 
ing Gaussian distribution. We created 500 realizations for each 
galaxy and derived the random error on the redshift as the 1 a 
width of the resulting distribution. 

The spectrophotometric redshift distribution for the full 
master sample of 102 galaxies is shown in Figure 4. The 
distribution shows that 77% of the selected galaxies are at z > 1 . 

4.2. Stellar Population Properties 

We compute luminosity-weighted age, stellar mass, star 
formation rate (SFR), and star formation history (SFH) of the 
master sample galaxies, using the same custom IDL code applied 
in the previous section, keeping the redshift fixed and using the 


Bruzual & Chariot (2003) library of stellar population synthesis 
models. We consider seven SFHs (continuous, exponentially 
declining with e-folding times r = 0.01,0.1, 1,5 Gyr, and 
exponentially increasing with r = 1,5 Gyr), 70 log-binned 
ages between 10 Myr and 12 Gyr, Salpeter initial mass function 
(IMF; Salpeter 1955), and solar metallicity. 

We use a Salpeter IMF based on recent results on local 
massive galaxies. A variety of observations, including stellar 
kinematics (e.g., Dutton et al. 2012; Cappellari et al. 2012), 
stellar populations (e.g., Conroy & van Dokkum 2012; Spiniello 
et al. 2012; Tortora et al. 2013), and gravitational lensing (e.g., 
Auger et al. 2010; Treu et al. 2010; Brewer et al. 2012), suggest 
that ellipticals and spiral bulges show “heavier” IMFs (Salpeter- 
like, with larger fractions of low-mass stars) than galaxy disks 
(Chabrier-like IMF; Chabrier 2003). Thus, the assumption of a 
Salpeter IMF seems to be more representative of the massive and 
quenched galaxy population we address in the present study. We 
note that if a Chabrier-like IMF was used, our inferred masses 
and SFRs would be smaller by a factor «1.7. Accordingly, our 
models and the mass-completeness limit we inferred from them 
(see Section 4.4) would be smaller in a similar amount without 
compromising our main results and conclusions. We also note 
that our specific SFRs (SSFRs) are robust to IMF changes. 
This is of particular importance as we base our quenched/SF 
classification on this parameter. Systematic changes of the IMF 
with galaxy mass are unlikely in the mass range considered in 
this work and are briefly discussed at the end of Appendix E. 

Solar metallicity is in agreement with recent spectroscopic 
results by Onoderaetal. (20 12) on quenched galaxies at z ~ 1.4. 
We considered a range of extinctions (0 ^ A v 1), and we 
used the Calzetti et al. (2000) extinction law. The age of the 
stellar population is constrained to be smaller than the age of the 
universe in the adopted cosmology. Analogously to the redshift 
random error determination, random uncertainties on the SPPs 
were derived using Monte Carlo simulations. 

We used the results of the stellar population modeling to select 
quenched galaxies as commonly done in the literature, based on 
the SSFR: quenched galaxies have SSFR < 0.01 Gyr 1 and 
constitute 71% of our master sample. 

4.3. Systematic Uncertainties in Redshift and 
Stellar Population Parameters 

The statistical uncertainties associated with the measure- 
ments of redshifts and stellar population parameters are as- 
sessed in Sections 4.1 and 4.2. Here, we further quantify the 
systematic uncertainties introduced by the degeneracies in stel- 
lar population models. It is well known that if the photometric/ 
spectroscopic data do not provide adequate constraints, study- 
ing SPPs of galaxies through SED fitting may produce strongly 
degenerate results by different combinations of age, SFR, SFH, 
and extinction (e.g., Muzzin et al. 2009 and references therein). 
For this reason, we performed a set of simulations to assess how 
well we can recover the SPPs with the available data. 

The simulations are described in detail in Appendix E. Briefly, 
we used Bruzual & Chariot (2003) models with known stellar 
population properties and redshifts to simulate our spectroscopic 
and photometric data. These “model data” were treated in the 
same way as the real data: we applied both the color and 
magnitude cuts as for our galaxies, and the selected model data 
were fitted with our customized IDL code to recover their SPPs. 
In this section, we only summarize the main results of these 
tests and their implications for our galaxy sample. 


6 


The Astrophysical Journal, 778:126 (24pp), 2013 December 1 


Bedregal et al. 




Figure 5. Left: the //Fi 60 W _ band magnitude histogram for our entire photometric catalog shows that it is highly complete above our //fi 60 W = 23.0 magnitude cut (red 
line). Right: in the stellar mass vs. redshift diagram for our sample galaxies (after removing cases with potentially strong systematics; see Section 4.3) we overplot 
Bruzual & Chariot (2003) model predictions for 100% mass completeness at different redshifts (red line) and our mass completeness cut at log(M sta r/Mo) = 10.65 
(black dashed line). Error bars in the bottom-right corner represent median total uncertainties (random and systematic). 

(A color version of this figure is available in the online journal.) 


Not surprisingly, the main conclusion of our study was that, in 
general, the SPPs were recovered more accurately for galaxies 
at z > 1.3, where the 4000 A break was fully covered by the 
grism spectra. More degeneracies appeared when the break was 
only coarsely covered by the UVIS photometric points. 

At redshift z. 1.3, our simulations show we can recover 
the age measurements to within ~35% of the age value for 
our complete age range. We can also distinguish between short 
(r 100 Myr) and long (r ^ 1 Gyr) SFHs, although we 
cannot separate SFHs with r = 100 Myr and r = 10 Myr. Most 
importantly, we find that our distinction between quenched and 
SF galaxies (defined by a rough limit at SSFR = 1 0 2 Gyr -1 ) 
is very robust. 

At lower redshifts, we found we cannot reliably recover spec- 
trophotometric redshifts below 1. Therefore, we decided to ex- 
clude from our galaxy sample all sources below this redshift. At 
1 < z < 1 .3 we recovered most of the SPP values as at z > 1 .3, 
although with larger scatter. At these lower redshifts, however, 
we found that the age and extinction show strong systematic off- 
sets with respect to the input parameter values. This was mainly 
produced by a strong anti-correlation between both galaxy prop- 
erties. However, because the distinction between quenched and 
SF galaxies is defined as a function of the SSFR only, even 
at z < 1.3 we can robustly distinguish between both galaxy 
types. The stellar mass is the most robust and best constrained 
parameter at all redshifts. 

Overall, the addition of IRAC 3.6 /rm data to the SED 
fits slightly reduces the scatter on the ages and stellar-mass- 
systematic uncertainties. The use of IRAC data provides the 
strongest constraints in the determination of SFR, where the 
scatter of our models is clearly reduced. The recovered redshifts 


also suffer a very mild improvement in their systematic uncer- 
tainties when IRAC data are used, while the other SPPs are 
mostly unchanged. 

From the discussion above, we decided to remove from the 
following analysis all galaxies with z < 1 • Also, we will not use 
in our analysis the age of galaxies at z < 1.3 and the extinction 
of all galaxies independently of their redshift. 

In Table 2 we present a summary of the results from the 
systematic error analysis for redshift and SPPs, including results 
with and without IRAC data in the SED fits. Formally, many 
of the systematic offsets found are consistent with zero within 
the uncertainties. However, some cases have larger uncertainties 
than others. Therefore, in those cases we present our results as 
zero-systematic-error ± their 68% percentile. 

4.4. Stellar Mass Completeness 

The estimate of the mass completeness is not straightforward 
in a magnitude- and color-selected sample like ours. Together, 
these criteria imply a segregation not only in mass but also in 
current star formation activity and SFH. 

First, we explore the //pi60W-band completeness of our 
photometric catalog. In the left panel of Figure 5, we show the 
//[-ifiow-band histogram for our entire photometry (Section 2.1). 
We proceed by running 1000 simulations of our //p^ow-band 
FOV and re-extracting with SExtractor our simulated galaxies. 
Each of the 1000 galaxies per simulation was modeled as 
a random combination of observational properties for which 
ranges were set based on the data. We allowed a range of //fi6ow- 
band magnitudes between 26 and 15. Then, light profiles were 
modeled with a 2D Sersic function with 0 ^ n ^ 4, centered 
at random positions, with ellipticities between 0 and 1, and 
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Table 2 

Systematic Errors in SPPs* 


Param. 

Syst. Error (with IRAC) 


Syst. Error (no IRAC) 


Redsh 

1.0 <z< 1.2 : 

0.0 ±0.10 




z > 1.2 

0.0 ± 0.05 



Age a 

1.3 ^ z < 1.8& Age ^ 1.5 Gyr: 

0.0% ± 75% 

1.3 ^ z < 1.8 & Age < 1.5 Gyr : 

0.0% ± 65% 


z > 1.3 & Age >1.5 Gyr : 

0.0% ± 35% 

z > 1.3 & Age > 1.5 Gyr : 

0.0% ± 45% 

^star b 

Z < 1.2 : 

«j20% 




1.2 z < 1.8 : 

10% 




Z^ 1.8: 

10% 

Z ^ 1.8 : 

15% 

SFR C 

SFR < 2.5 M 0 yr -1 : 

0 




2.5 < SFR ^ 7.5 M 0 yr' 1 : 

0.0 ± 1 M 0 yr' 1 

2.5 ^ SFR ^ 7.5 M 0 yr' 1 : 

0.0 ± 2 M 0 yr' 1 


7.5 ^ SFR ^ 15 Mq yr' 1 : 

0.0 ± 3 M 0 yr' 1 

7.5 < SFR < 1 5 M 0 yr' 1 : 

0.0 ± 5 Mq yr' 1 


15 < SFR ^ 20 Mq yr -1 : 

0.0 ± 8 M 0 yr' 1 

15 < SFR ^ 20 M 0 yr' 1 : 

0.0 ± 15 Mq yr' 1 


SFR > 20 Mq yr' 1 : 

25 ± 20 M 0 yr' 1 

SFR > 20 Mq yr' 1 : 

45 ± 40 M 0 yr' 1 


Notes. 

* In the “no IRAC” column, if an empty space is present, the results are the same as in the “IRAC” case. 
a Only ages for z ^ 1.3 galaxies are considered in this work. 

b At z < 1.2, the systematic errors in stellar mass vary monotonically from 30%, 20%, and 10% at log .V/, tar / M q = 11.8, 11.0, and 10.6, 
respectively. 

c At z ^ 1.3, the error is valid for SFR < 20 Mq yr' 1 only. 


random projected orientations. The effective radii were allowed 
to vary between (T2 and O'.' 6 (typical range for our galaxies). We 
add random noise to every single model run such as an average 
// F160w = 22 mag galaxy has an integrated S/N = 200, like 
in the data. After extracting galaxies from each simulation, we 
study the completeness as a function of // F i60W-band magnitude, 
in bins of 0.2 mag. We found we are 100% complete down to 
magnitudes of 24.2, reaching 50% completeness at 25 mag. 
We also study completeness as a function of other parameters. 
Based on the effective radius, we are 100% complete up to 
0'.'35, reaching 70% completeness at 0'.'43. In terms of the Sersic 
index, we are 70% complete down to n = 2. Finally, we are 75% 
complete for ellipticities up to 0.4. Based on these results, our 
magnitude cut at // F i 60 W = 23 for our galaxy sample (Section 3) 
is conservative enough to allow us to study a 100% complete 
sample in //pi 60 w band with also very high completeness in 
other observables. 

In the right panel of Figure 5, we show the galaxy stellar 
mass as a function of redshift for our galaxies (after cleaning 
the sample of cases with possible strong systematics in their 
SPPs; see previous section). Due to our //fi60w = 23 magnitude 
cut, the lower mass envelope changes with redshift and depends 
on the galaxy SFH and age. In order not to introduce any bias in 
the analysis, we conservatively estimate the mass completeness 
limit to be the mass of the model with the highest M/L ratio for 
^Fi 60 W = 23 at each redshift, as indicated by the solid red curve 
in the figure. 1- ' In what follows, we adopt a mass completeness 
limit of 4.5 x 10 10 M 0 (dashed black line in right panel of 
Figure 5), which is the minimum mass measurable at z ~ 1.5 
with our conservative assumptions. We note that ~70% of our 
galaxies were above this mass cut. Of those, 40% are at or below 
redshift 1.5. 

4.5. The Final Sample of Quenched Galaxies 

The final sample of quenched galaxies includes sources at 
1.0 ^ z < 2.0. At z > 2.0 our mass limit implies that we 
find only a few of the rarest most massive galaxies, while 

15 The red curve shows the stellar mass corresponding to 77fi60W = 23, for an 
exponentially declining SFH, with r = 10 Myr and formation redshift z = 5. 


at z < 1.0, our simulations show that we cannot reliably 
retrieve photometric redshifts with our data (see Section 4.3 
and Appendix E). 

After the redshift and SSFR cuts, our sample includes 41 
quenched sources with H ^ 23, J — H ^ 0.6, log (M star /M 0 ) ^ 
10.65 and redshifts in the range 1.0 ^ z < 2.0. Four of the 
quenched galaxies (10%) have detected emission lines. 

We do not remove these galaxies from the sample of quenched 
objects for the following reasons: first, if star formation is at 
the origin of the detected emission lines, the line fluxes imply 
SFRs of about 0.4 M 0 yr" 1 (using a Kennicutt 1998 conver- 
sion), which, given the masses of these objects, would still make 
them quenched according to our SSFR definition. Second, the 
emission lines might be produced by ionization due to an ac- 
tive galactic nucleus (AGN), rather than star formation. Various 
studies connect AGN activity with the quenching of star forma- 
tion (e.g., Di Matteo et al. 2005; Hopkins et al. 2006; Oser et al. 
2012), so we do not want to bias our result against AGN activity. 
Because we lack information on the origin of the gas ionization, 
we leave the four emission-line galaxies in the sample and iden- 
tify them in the analysis when needed. As will be shown in 
the following sections, the presence of quiescent galaxies with 
emission lines does not affect our main results and conclusions. 

In Figure 6 (available in the electronic format of the paper), we 
show the 7pnow-band postage stamp of each galaxy of our final 
sample . In the same figure we present the G 1 02 and G 1 4 1 spectra 
together with the WFC3/UVIS and IRAC 3.6 pm photometric 
points used in the SED fits. The best SED model fit and some 
SPPs are also included. This 41 galaxy sample will be used in 
the rest of the paper. 

5. RESULTS AND DISCUSSION 

In the following section, we discuss the properties of the final 
sample of 41 quenched galaxies. 

5.1. Distributions of Stellar Population Properties 
of the Quenched Galaxy Sample 

In the top panel of Figure 7, we show the distribution of 
redshifts for our quenched galaxy sample. Redshifts in our 
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Figure 6. Examples of SED fits for our sample galaxies (fits for our 41 galaxy sample available in the electronic format of the paper). Left: 6x6 arcsec 2 image stamps 
in //fi60W band. Right: the data and SED fits. In black, the G102 and G141 spectroscopic data. In blue, the photometric data. Circles represent //5T/WFC3/UVIS 
Z?F 475 x an d ^F600LP photometry (arrows correspond to 3 a upper limits). Triangles represent Spitzer / IRAC 3.6 /xm observations (when available). In red, the best SED 
model fit from our library of Bruzual & Chariot (2003) stellar population models. In the y axis, flux densities, Fx, are normalized to 10 -19 erg s _1 cm -1 A -1 . 

(An extended, color version of this figure is available in the online journal.) 
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log(Stellar Mass) [M sun ] 

Figure 7. For our sample of 41 quenched galaxies, we show the histograms of 
redshift and stellar mass. Red dashed lines represent the median value for the 
histogram. 

(A color version of this figure is available in the online journal.) 


sample spread between 1.0 and ~2.0, with a median of 1.5. The 
sharp drop of the distribution for z < 1 is a direct consequence of 
our selection against these galaxies. As described in Appendix E, 
the redshift determination at z < 1 is dominated by strong 
systematics, obviously affecting the derived SPPs. 

We present the stellar mass distribution in the bottom panel 
of Figure 7. The stellar masses of quenched galaxies spread 
from log(M star /M 0 ) = 10.65 (our mass cut) to 11.70 with a 
median 11.15. 

The SFHs of our quenched galaxies are consistent with short 
star formation bursts, with all the galaxies having exponentially 
declining SFH with r ^ 100 Myr. 

In Figure 8 we present the luminosity-weighted age distri- 
bution for galaxies at z 1.3 (where this parameter could be 
constrained; see Section 4.3 and Appendix E). At these red- 
shifts our sample consists of 32 quenched galaxies. We observe 
a wide distribution of ages between 1 and 4 Gyr with a median 
of 2 Gyr. We compare these results with data of quenched galax- 
ies at similar redshifts from literature. The mean ages (~3 Gyr) 
found by Longhetti et al. (2005), Onodera et al. (2012), and 



!og(Age) [yr] 


Figure 8. Histogram of luminosity-weighted age for 32 quenched galaxies at 
z ^ 1.3 (in red). Dashed line corresponds to the median of the distribution. 
We compare our median age for quenched galaxies with ages from early-type 
galaxies from the literature: in turquoise, Longhetti et al. (2005) mean age range 
(predictions from different models) of 10 early-type galaxies at z ~ 1.5; in 
purple, van Dokkum et al. (201 1) age range of 15 low-Ha -emission galaxies at 
1 < z < 1.5; in green, the mean age of 18 z ^ 1.4 passively evolving galaxies 
from Onodera et al. (2012). 

(A color version of this figure is available in the online journal.) 


van Dokkum et al. (2011, 2—4 Gyr) are systematically above 
our median age, even though our age histogram shows a peak 
at 3-4 Gyr consistent with these works. This may be due to 
different selections between our sample and those from the lit- 
erature. In particular, being selected as extremely red objects 
(EROs; R — K' > 5), the Longhetti et al. (2005) sample is 
probably biased toward the oldest galaxies at these redshifts. 
The van Dokkum et al. (2011) sample is based on WFC3 grism 
data as in our study (though they only have the G141 grism). 
However, their sample is mass selected, while we use mass and 
color to select our quenched galaxies. Moreover, the reported 
ages in van Dokkum et al. are for galaxies significantly more 
massive than our sample (M star > 10 n M G ). Finally, in order 
to maximize observing efficiency, the Onodera et al. (2012) 
sample was constrained to galaxies inhabiting regions of high 
galaxy overdensity, where the star formation and quenching 
take place more quickly. Therefore, the Onodera et al. sample 
would include mostly very old galaxies at the redshifts they 
were observed. Compared to BzK z > I galaxy selection 
(and alike), which separates SF and quenched galaxies (e.g., 
Daddi et al. 2004; Cameron et al. 2011), our combination of 
spectral fitting and ./fiiow— Hfi60w color cut allows for blue 
(younger) quenched galaxies to be included in the sample (see 
section below). 

In summary, our wide age distribution covers the different 
age regimes previously reported in the literature. Although 
our median age (2 Gyr) is below previous reports, we have 
shown that this does not imply an inconsistency between the 
different results, but most likely it reflects different selection 
biases intrinsic to each galaxy sample. 
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Figure 9. Left: in the rest-frame u — r color vs. stellar mass diagram, we illustrate the large fraction of quenched (red) galaxies that have blue colors at these high 
stellar masses. Galaxies below the continuous tilted line are considered as "blue.” This line represents a linear fit to the local SDSS “red sequence” passively evolved 
to z = 1.5 (dashed) plus an extra offset from the local red-sequence width (—0.42 mag total offset). Asterisk symbols represent galaxies with emission lines. In gray, 
a density map with ~ 1 1 .000 galaxies from SDSS-DR7 with z D 0.05. In the bottom right comer, we show the median uncertainty in stellar mass for our galaxy 
sample. Right: we illustrate the (« — r ) o color distributions for quenched galaxy samples. Overplotted is a color histogram for SDSS galaxies with masses larger than 
our completeness mass limit. Each distribution is normalized to the total number of galaxies. Rest-frame colors were calculated from the best-fit Bruzual & Chariot 
(2003) model SED. 

(A color version of this figure is available in the online journal.) 


5.2. Mass versus Rest-frame u — r Color 

In Figure 9 we show rest-frame u — r color versus stellar 
mass for our sample of 41 quenched galaxies (in red, asterisks 
for galaxies with emission lines) together with local data from 
SDSS-DR7 (gray region; Abazajian et al. 2009). Sloan Digital 
Sky Survey (SDSS) colors for our galaxies were computed 
using the best-fit SED model and filter curves obtained from the 
SDSS webpage. When galaxies within our stellar mass range 
(log (M stal -/M Q ) D 10.65) are considered, the vast majority of 
SDSS galaxies belong to the red sequence. The color distribution 
for these objects shows a narrow peak at (u—r) o ~ 2.7. This 
is visible in the right panel of Figure 9, where we show 
the (« — r)o distribution of the local galaxy sample with 
log (M star /M 0 ) D 10.65 (gray histogram), together with the 
color distribution for our quenched galaxy sample. Our massive 
galaxies show comparatively a much broader color distribution 
than massive galaxies in the local universe, with (u—r) o colors 
as blue as 1.7 (consistent with recent results; see, e.g., van 
Dokkum et al. 2011; Newman et al. 2012). We note that, because 
of the color preselection applied to our galaxy sample, we 
are excluding even bluer, higher SFR galaxies. Therefore, the 
number of blue massive galaxies we obtain can be considered 
as a lower limit, implying an even larger spread in (u — Do 
colors at these redshifts compared to the local massive galaxy 
sample. 


5.3. Young and Old Quenched Galaxies 

In Figure 9, we find that our quenched-galaxy color distri- 
bution peaks at (u — r) o = 2.5, approximately 0.2 mag bluer 
compared to the peak of the color distribution of local red- 
sequence galaxies. This color difference matches the passive 
evolution of a short burst of star formation expected between 
Z ~ 1.5 (the median redshift of our galaxy sample) and z ~ 0. 

About 32% of our sample, however, has colors substantially 
bluer (DO. 4 mag) than the local red sequence, overlapping 
with the colors of local blue-cloud galaxies at masses below 
M star ~ 3 x 10 9 M q . In Figure 9, we show the best-fit red 
sequence to the z = 0 SDSS data passively evolved to z ~ 1-5 
(0.2 mag offset; dashed line) and with an additional offset so 
galaxies fall within the scatter of the red sequence (0.22 mag; 
continuous line). The SDSS red sequence was computed by 
fitting a linear relation to the SDSS log(M sta r) and ( u — r)o, 
while the width was taken as the 2 a dispersion with respect 
to this fit. We use this line to split our sample into quenched 
galaxies on and off the z ~ 1.5 red sequence. Approximately 
1 /3 of the sample is off the red sequence. 

At the beginning of this section, we describe how our 
7 fhow— 77fi60w selection method produces a quenched galaxy 
sample with lower median age than previous works in the 
literature. We also describe how the selection effects of each of 
the cited works systematically bias their samples toward older 


11 


The Astrophysical Journal, 778:126 (24pp), 2013 December 1 


Bedregal et al. 




2000 3000 4000 . 5000 6000 7000 

Wavelength [A] 



Figure 10. Stacked rest-frame spectra of z ^ 1.3 quenched galaxies emphasize 
the differences between on-red-sequence sources (top, 19 sources) and off-red- 
sequence galaxies (bottom, 13 sources). Stacked spectra and UVIS data points 
for individual galaxies are shown in orange, while best-fit Bruzual & Chariot 
(2003) models to the spectra are shown in black. The models are exponentially 
declining SFHs with r = lOMyr, solar metallicity, Salpeter IMF, and ages of 
3.0 and 1.4 Gyr for on- and off-red- sequence galaxies, respectively. In the top of 
each panel, we show the number of galaxies used in the co-added spectrum as a 
function of wavelength. Flux densities are normalized to median flux between 
5500 and 6000 A. 

(A color version of this figure is available in the online journal.) 


ages. Driven by this result, we further investigate the ages of 
our two sub-samples: quenched galaxies on and off the red 
sequence. Limiting the sample to quenched galaxies at z 1.3 
(where the age-sensitive 4000 A/Balmer break is covered by the 
spectroscopy), we find a significant age difference between the 
stellar populations of galaxies on and off the red sequence. 
The red-sequence quenched galaxies have a median age of 
3.1 Gyr, while quenched galaxies off the red sequence have 
a median age of only 1.5 Gyr. 

We further investigate this age difference by stacking the 
spectra of the quenched galaxies on and off the red sequence in 
order to increase our S/N. Figure 10 shows the co-added spectra 
of quenched red-sequence (top; 19 sources, orange) and off-red- 
sequence galaxies (bottom; 13 galaxies, orange), together with 
their best- fit Bruzual & Chariot (2003) stellar population models 
(black curves). The stack data were fitted using our full model 
library, where only metallicity was fixed to solar. The best-fit 
models are exponentially declining SFHs with r = lOMyr, 
solar metallicity, Salpeter IMF, and ages of 3.0 and 1.4 Gyr for 
on- and off-red-sequence galaxies, respectively. As we see, these 


models provide a good representation of the stacked data. Also, 
they confirm our results from the two galaxy sub-samples based 
on individual-galaxy analysis. These results are consistent with 
recent findings of Whitaker et al. (2013) in co-added spectra, 
even though their ages for blue (0.9 Gyr) and red quenched 
galaxies (1.6 Gyr) are younger than ours. 

5.4. “ Rejuvenation ” of Old Quenched Galaxies 
through a Secondary Starburst? 

The Bruzual & Chariot (2003) models we considered in 
Section 5.3 assume an SFH with a single massive star formation 
event that passively declines with time. In this scenario, a 
possible interpretation of the distinction between the measured 
ages of galaxies on and off the red sequence arises from the 
progress of the sample toward the red sequence observed at 
different epochs of evolution for individual galaxies. 

Even though these models fit the co-added spectra well, 
these are not the only possible SFHs for these galaxies. Small 
secondary starbursts (SBs) after quenching of the main stellar 
component have been observed at low and high redshifts (e.g., 
Kaviraj et al. 2013 and references therein). In this section, we 
consider the possibility that our quenched galaxies lie off the 
red sequence due to a relatively recent minor burst of star 
formation that rejuvenates the galaxies’ SEDs, making them 
appear relatively younger due to the production of luminous, 
high-mass O/B/A/ stars. 

To test this scenario, we constructed a new set of model SFHs, 
which are defined by a primary star formation event (fixed at 
very high z) followed by a smaller, secondary SB. Both the main 
stellar component and the SB are modeled with exponentially 
declining SFRs, with solar metallicity, Salpeter IMF, and no 
extinction. The main component produces a total stellar mass 
of 10 11 M q with r = lOMyr. The age of this main component 
before the SB ranges between 0.5 and 4.25 Gyr in steps of 
0.25 Gyr (16 model variants). The secondary SB has intensities 
of 1%, 5%, 10%, and 50% the total galaxy mass (four model 
variants) with r = 100 Myr. We observe the resulting SEDs at 
the time of the SB peak and after 50, 100, 250, and 500 Myr 
of that event (five model variants). The combinations of these 
possible parameters make a library of 320 models to be fitted to 
the co-added data. 

In Figure 11, we present the best model fits for our 
co-added spectra of quenched galaxies on and off the red se- 
quence (orange). 

Just as a point of reference, in the top panel of Figure 11, 
we see (in blue) the best model fit for quenched galaxies on 
the red sequence that includes a secondary SB. This model has 
an age of 4.5 Gyr. The main stellar component is 4.0 Gyr old 
before the SB, with a burst mass of 1%. The coarse shape of 
the co-added spectra is fitted by this model, showing, however, 
a larger x 2 value than the single-burst best model (top panel of 
Figure 10). The predicted 4.5 Gyr is older than the 3 Gyr from the 
single-burst model. This is not surprising as the galaxy must 
passively evolve after the SB to recover the originally red SED. 

In the central panel of Figure 1 1 , we present the best model 
fit to the co-added spectra of quenched galaxies off the red 
sequence. This model has an age of 1 .5 Gyr. The main stellar- 
component age before the SB is 1 .0 Gyr, with a burst mass of 1 %. 
The predicted age of 1 .5 Gyr is consistent with the single-burst 
model prediction for these galaxies. We highlight that the best 
model for the quenched off-red-sequence galaxy data is not a 
combination of an old, passive stellar population with a younger 
SB. The fitting process demands both stellar populations to 
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Figure 11. Same stacked data as in Figure 10 (orange). Top panel: co-added 
spectra and individual-galaxy photometry of quenched galaxies on the red 
sequence together with the best model fit with two stellar components (main+SB, 
in blue). Middle panel: our two-stellar-component models cannot reproduce a 
“rejuvenation” SFH (old main component + young SB) consistent with the data. 
We show the co-added data of quenched galaxies off the red sequence together 
with the best model fit with two stellar components (both components young, 
in blue). Bottom panel: off the red-sequence galaxy data together with three 
models: in purple, best model fit among those observed at the SB peak; in 
green, best model among those with ages of observation ^3 Gyr; and in red, 
best model found for galaxies on the red sequence (top panel), with the proper 
scaling factor to better fit the off-red-sequence data. 

(A color version of this figure is available in the online journal.) 


be similarly young, resembling a single-component young 
burst, i.e., the “rejuvenation” scenario is not supported by this 
analysis. 

Given the above result, we proceed by being more selective on 
choosing models that resemble a “rejuvenation” SFH and test if 
they fit the data of off-red-sequence galaxies. In the bottom panel 
of Figure 1 1 we show three examples. In purple, we plot the best 
model fit to the co-added spectrum for all models observed at 
the peak of the SB. The main stellar component is, again, young 
(1.5 Gyr). More important, even the weakest SB we are testing 
(1%) cannot reproduce the data and clearly underpredicts the 
flux in the red extreme of the co-added spectrum. Also, the blue 
flux is overpredicted as can be seen by comparing the best-fit 
models to the individual-galaxy UVIS data points (not included 
in fit). Only one upper limit among 26 data points is consistent 
with this blue SED model. 

In addition, we also considered only those models with an 
old (age 3 Gyr) primary stellar component. The resulting 
best-fit model is shown in green in Figure 11. Similarly to 
the previous library, also this best-fit model overpredicts the 
flux at these blue wavelengths, and it is consistent with only 4 
out of 26 observed UVIS data points. Finally, we overplot in 
red the best-fit model found for galaxies on the red sequence, 
with the proper minimization-scaling factor to fit the off- red- 
sequence data. This model is reasonably compatible with the 
individual-galaxy UVIS data (as before, not included in fit), but 
it clearly overpredicts the flux in the red extreme of the co-added 
spectrum. 

In summary, after testing a library of models that include 
a secondary SB, we have found that they cannot reproduce 
the “rejuvenation” scenario for most of our qjf-red-sequence 
galaxies. The better fits are produced with overall young stellar 
populations (for both main component and SB), resembling a 
single young stellar component. The use of Gyr main stellar 
components systematically over/underpredicts the data flux at 
different wavelengths. We remind the reader, however, that we 
have not considered extinction in the model library used in 
this section. As we have seen before, extinction is a poorly 
constrained parameter that also introduces strong degeneracies 
with other SPPs. Therefore, the results shown in this section 
must be taken with caution and restricted to the model library 
we use here. 

5.5. Red Fraction of Massive Galaxies at z ~ 1.5 

We also investigate the fraction of quenched galaxies that have 
already settled on the red sequence (/ re d) and those that have 
not, at the average redshift of our sample (z. ~ 1.5). Using the 
red sequence computed above, we found that 28/41 galaxies fall 
within 2a of the red sequence. These galaxies constitute 68% 
of our whole sample. Since all blue SF galaxies are excluded 
from our selection, we infer that less than 68% of all galaxies at 
Z ~ 1.5 with log (M star /M 0 ) ^ 10.65 fall on the red sequence. 
We stress that this is an upper limit. While our /fhow— F/fi60w 
color pre-selection allows us to select all of the red galaxies at 
Z > 1.0, it systematically excludes blue galaxies at the same 
redshifts. 

We can improve the upper limit on / re( j by including emission- 
line-selected sources in the same apparent magnitude, redshift, 
and mass ranges of our sample. For this, we use the Dominguez 
et al. (2012) sample of emission-line galaxies from 17 WISP 
fields (all fields in common with our sample). These galaxies 
are selected only by the presence of the Ha emission line. 
Dominguez et al. limit their search to z = 1.5, thus probing 
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a smaller volume compared to our study. As a result, the value 
of / re( j provided below has still to be considered as an upper 
limit. We applied the same //n low - band magnitude, mass, and 
redshift selection criteria used in our sample galaxies, to identify 
sources within our parameter ranges. This selection results in 
15 galaxies that are not in common with our galaxy sample. 
Stellar masses from Dominguez et al. were scaled by a factor 
of 1.7 to account for the different IMFs used (Chabrier in their 
case, Salpeter in ours). By scaling the number of emission-line 
sources to our number of fields, we derive an expected number 
of 24 blue massive emission-line galaxies. This implies that, at 
Z ~ 1.5, /red < 43%. 

Recently, Peng et al. (2010) predicted the evolution of the 
red fraction as a function of galaxy mass and redshift, based 
uniquely on empirically motivated relations. Peng et al. predict 
that at z ~ 1.5, at the median mass of our galaxy sample, 
between 70% and 90% of the galaxies should have colors 
consistent with being on the red sequence. These fractions 
were derived from their predictions for galaxies in low-density 
environments. 16 

We also note that, even though the typical mass of our sample 
is well within what Peng et al. identify as the “mass-quenching” 
regime (i.e., quenching depends only on intrinsic properties of 
the galaxy and not on the local environment it inhabits), our 
color-mass relation in Figure 9 does not show such a mass 
dependency. In our data, whether a quenched galaxy resides on 
or off the red sequence appears to be independent of the galaxy’s 
mass. On the contrary, a mass-dependent quenching mechanism 
would demand a rising left-to-right gradient in the color-mass 
diagram, such as the most massive galaxies mostly populate 
the red sequence, while less massive systems should be bluer. 
This is not observed in our galaxy sample, despite covering one 
order of magnitude in mass. Therefore, it seems reasonable to 
conclude that both internal (e.g., mass quenching) and external 
(e.g., environment) mechanisms are at play in quenching the 
star formation in some of these massive galaxies. 

We speculate that the young massive galaxies with no ongoing 
star formation observed at z ~ 1.5 may be in transition between 
a phase of vigorous star formation at z > 2 and the z ~ 1.5 
red sequence. The large masses and short SF timescales that 
characterize these galaxies imply very high past SFRs (of the 
order of ~ 1 000 M 0 yr 1 ). The survey volume covered in the 
1.3 < z < 1.7 redshift range is a few xlO 8 Mpc 3 , implying 
a volume density for our quenched galaxies of ~10 -7 Mpc -3 . 
Even considering that we had to remove from our sample about 
half of the galaxies because of spectral contamination, this 
volume density is still comfortably below the observed space 
densities of some of the likely progenitors of these sources, 
i.e., ultraluminous IR galaxies at z ~ 2.5 (6 x 10 -6 Mpc -3 ; 
Chapman et al. 2005). 

Finally, we would like to highlight that the results presented in 
this paper do not change strongly because of a misclassification 
of emission-line galaxies as “quenched.” If we remove the four 
quenched galaxies with emission from the sample, we find the 
following results. First, for quenched galaxies on and off the 
red sequence the median ages (3.1 Gyr and 1.3 Gyr) and single- 
component ages from co-added spectra (3.0 Gyr and 1.3 Gyr) 
are virtually unchanged with respect to our complete sample 
estimations. Second, we find that 72% (68% previously) of the 
sample galaxies are on the red sequence. Also, the / re d < 43% 


16 The red-galaxy fraction for galaxies in high-density environments is 
predicted to be larger. 


does not change with respect to the upper limit found previously. 
As a consequence, we can be confident that neither the results 
nor the global picture presented in this work are affected by SF 
galaxies misclassified as quenched systems. 

6. CONCLUSIONS 

We have used the WISP survey to identify a sample of massive 
(log(M star /M G ) ^ 10.65) galaxies at redshift 1.0 Sj z < 2.0. 
The sample was selected to have /fhow— 7^fi60w Z 0.6 and 
//hi eow Z 23, in 27 independent fields, overcoming the effect 
of cosmic variance, which typically plagues massive galaxy 
samples at these redshifts. Our color selection implies that the 
final sample is biased against star-forming galaxies. We derived 
stellar population parameters by fitting stellar population models 
to the combined broadband photometry (//5T/WFC3-UVIS, 
Z/ST/WFC3-IR, Spitzer/IRAC) and IR spectra (0.9 fim < ), < 
1.6/im). We have shown that the availability of rest-frame 
optical spectra covering the 4000 A/Balmer breaks dramatically 
improves the reliability and accuracy of the derived stellar 
population parameters. 

Our results are based on a color- and mass-selected sample of 
41 quenched galaxies. In agreement with other studies, we find 
that at (z) = 1.5, the mass range above log (M star /M 0 ) = 10.65 
is populated by galaxies with a wide range of stellar population 
properties. We find that quenched galaxies are well fitted 
with exponentially decreasing SFFIs and short star formation 
timescales (r / 100 Myr). They also show a wide distribution 
in stellar ages, between 1 and 4 Gyr. 

We find that quenched galaxies are far from being a homoge- 
neous population. In the (u — r)o-versus-mass space, quenched 
galaxies have a large spread in rest-frame color at a given mass. 
Most quenched galaxies populate the z ~ 1.5 “red-sequence,” 
although 32% of them have substantially bluer colors. We find 
that quenched galaxies on the red sequence have older median 
ages (3.1 Gyr) than the quenched galaxies off the red sequence 
(1.5 Gyr). The average ages of the two subsamples (on and off 
the red sequence) are confirmed by the analysis of their stacked 
spectra. Furthermore, we also demonstrated that a “rejuvenated” 
SED cannot reproduce the observed stacked spectra. 

We derive the upper limit on the fraction of galaxies on the 
red sequence at z ~ 1.5 to be / re( j < 43%, in disagreement 
with empirical model predictions from Peng et al. (2010, 
/ red = 70%-90%). This mismatch can partially be due to 
the Peng et al. assumption of an instantaneous quenching 
mechanism. However, the homogeneous spread in mass of our 
quenched galaxies on and off the red sequence suggests that 
more than one mechanism is responsible for the quenching 
at these stellar masses (i.e., internal, galaxy-mass-dependent 
versus external, environmental triggers). 

We speculate that the young massive galaxies with no ongoing 
star formation are in a transition phase between vigorous star 
formation at z > 2 and the z ~ 1.5 red sequence. According 
to their estimated ages, the time required for quenched galaxies 
off the red sequence to join their counterparts on the z ~ 1 .5 
red sequence is of the order of ~ 1 Gyr. 

The question concerning what mechanisms halt star forma- 
tion remains open. Having been quenched more recently, the 
galaxies off the red sequence with no ongoing star formation 
will be the ideal laboratory to further investigate this process. 
Once the WISP survey is completed, we will be able to study 
this and other galaxy populations in greater detail, with a much 
larger number of statistics. 
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Figure 12. Example of charge transfer efficiency (CTE) correction in field Par83. Left: in the CTE correction vs. F475X magnitude (uncorrected) plot, we illustrate 
that for most of our sample galaxies the corrections are below 0.1 mag (<5% their total fluxes). Right: in the CTE correction vs. distance to the amplifiers [pixel] plot, 
we see that the correction is usually larger as the distance to the amplifier increases. 

(A color version of this figure is available in the online journal.) 
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APPENDIX A 

THE UVIS CHARGE TRANSFER 
EFFICIENCY CORRECTION 

WFC3/UVIS CCDs experience a degradation of their charge 
transfer efficiency (CTE) over time, introduced by their exposure 
to energetic radiation. The CTE degradation and fractional 
associated losses worsen for low sky background, faint fluxes, 
and distance from the readout amplifiers. These effects are 
discussed in Noeske et al. (2011) for observations obtained 
between 2009 October and 2011 October. Correction for CTE 
losses is currently not implemented in the reduction packages 
for the WFC3-UVIS data, so we corrected the measured fluxes 
as follows. 

Noeske et al. (2011) provide linear fits of the CTE losses 
as a function of background electron counts, position on the 
detector, and source flux. We interpolated the linear fits to 
compute, for each galaxy, a unique correction as a function of the 
source counts and position on the CCD. For sources fainter than 
500 e _ , the correction was obtained by extrapolating the solution 
from higher counts, while no correction was applied for sources 
brighter than 16,000 electrons. As the source detection for our 
photometric catalog was performed in the deeper (and brighter) 
•^Fiiow images, many of the WFC3/UVIS fluxes are below the 
extrapolation limit of 500 e (~45% of the catalog sources). 
Typically, these sources have median magnitudes between 25.0 
and 25.5 in the different UVIS bands used in this study. Most of 
these sources («95%), however, are also below our 3<r detection 
limit, so upper limits for their magnitudes (independent of the 
CTE correction) are used instead (see Section 2.1 for details). 


Therefore, in practice, most of the sources with 3a are above 
the extrapolation limit for the CTE correction. 

In Figure 12, we show an example of the CTE correction 
in one of our fields versus the original source magnitude (left) 
and versus y axis distance with respect to the amplifiers (right). 
Most of the galaxies in our final sample (see Section 3) have 
corrections below 0.1 mag, typically corresponding to <5% in 
their total fluxes. 

APPENDIX B 
SPECTRA EXTRACTION 

One of aXe’s features is the two-dimensional (2D) modeling 
of the source’s spatial profile for each galaxy. In what follows, 
we refer to the target galaxy as the primary and to other 
galaxies that may contaminate the spectrum as secondaries. 
In Figure 13, we show a typical example. The second panel 
of Figure 13 shows the 2D dispersed stamp extracted by aXe. 
The stamp is centered on the spectrum of the primary (circle 
in the top panel), and the spectra of two secondary objects 
are visible below it. Within aXe, source profiles for each galaxy 
are fitted at each wavelength assuming a single Gaussian profile. 
Although the single Gaussian is a sufficient approximation for 
faint, poorly resolved sources, this assumption breaks down for 
brighter resolved galaxies (such as those in the example shown 
in Figure 13). Resolved, brighter sources typically present 
extended wings, which are poorly modeled by a single Gaussian 
profile. 

After testing different functional forms to describe the ex- 
tended wings, we converged on describing each galaxy’s profile 
with a combination of two Gaussians sharing the same center, 
but with different amplitudes and width. We fit a profile of the 
form 

( —V 

G 2 (y) = Ai-e\ Sl / + Ai ■ e 
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Figure 13. Example of 2D spectral stamp for slitless spectroscopy (source Parl36 ID14 in G141 grism data). Blue symbols and labels correspond to primary target 
source; green color represents contaminants, secondary sources. Top panel: //pi60W imaging of the sources. Second panel: original 2D stamp after aXe reduction. 
Third panel: 2D stamp after removing the secondary sources with aXe. Fourth panel: 2D stamp after removing the secondary sources with our models. Fifth panel: 
residual after removing our 2D model of the primary source. At ~12200 A, the residuals from an emission line are visible. 

(A color version of this figure is available in the online journal.) 


where A i, and Ai are the amplitudes of the two Gaussian 
components, 5j, and S3 are the width, and Y c is the coordinate 
of the center in the cross-dispersion direction. This function 
provides the best compromise between functional flexibility and 
number of parameters to adjust. 

As is clear from Figure 1 3, the contamination from secondary 
sources changes with wavelength (e.g., due to relative position 
of sources in the sky, spectral features like absorption breaks 
and emission lines, and variation of the grism transmission). 
For this reason, we fit a combination of G 2 (y) functions (one 
for the primary and one for each secondary) to the spatial 
profile extracted from the 2D dispersed stamps after a three pixel 
binning in the wavelength direction. Note we are only binning 


the data to perform the fit. The spectral resolution for the rest of 
the analysis was not degraded. 

The fit is performed by minimizing the y 2 between the models 
and the observed profile at each wavelength position. The first 
guess for the parameters (A 1; .S) , A,-, .S', ) is estimated from 
the direct F110W images (for the G102 spectra) and F160W 
images (for the G141 spectra). Contrary to their appearance in 
the dispersed images, galaxies are typically well separated in the 
sky, allowing us to obtain accurate relative positions and widths 
for all components entering the fit. 

In Figure 13, we show an example of this procedure. The sec- 
ond, third, and fourth panels show an original (aXe output) 2D 
dispersed stamp, a version clean from secondary contaminants 
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Figure 14. Example of mean ID spatial profile and ID spectral extraction (source Par69 ID 1 2 in G 102 grism data). Left panel: mean spatial profile of Par69 ID 12 
spectral stamp. Blue is the best model to the primary source, green is the best model of the contaminant secondary, black is the data, and red is the best total model. 
The orange dashed line is the mean primary profile from aXe. The inner panel in the top right shows aXe’s and our primary profiles with an arbitrary normalization. 
This allows a fair comparison of the profiles as mean weight masks. Right panel: extracted ID spectrum. In black, the data are shown without cleaning. In orange and 
blue, we show aXe’s and our own extractions, respectively. 

(A color version of this figure is available in the online journal.) 




using the aXe model, and a version cleaned using our procedure, 
respectively. 

In the top panel of Figure 14, we show the mean spatial 
profile of the Figure 13 example target (black curve). We show 
our best-fit component to the primary (blue curve) and the 
secondary (green curve) profiles and also include the best-fit 
aXe estimation of the primary profile (dashed orange curve). 
In the inset panel, we show aXe’s and our profiles with 
a common (arbitrary) normalization. Because in the optimal 
spectral extraction the profile is used to weight each pixel in the 
final spectrum, the aXe profile assigns relatively more weight 
to the lower S/N wings with respect to the peak flux. This 
effect, together with a poorer aXe removal of contaminants, 
may translate into an overestimate of the total flux in the ID 
extracted spectrum and a lower S/N. 

In Figure 1 5 , we compare the integrated flux in the wavelength 
range 0. 8-1.1 /xm computed in aXe’s and our extraction. In 
the comparison, we consider only sources brighter than /pnow 
^ 23. Although there is a broad agreement between the two 
measurements, the residuals (shown in the bottom panel) show 
that aXe’s fluxes are systematically overestimated from about 
10% for bright objects up to ~50% for some faint sources. The 
flux overestimate is due to different effects for bright and faint 
sources. In fact, what dominates in the bright sources is the 
poor fit of the single Gaussian profile, while at the fainter level 
the dominant contribution is due to the poor removal of nearby 
sources. 

APPENDIX C 

CONTAMINATION FROM STARS 


(Bertin & Arnouts 1996) were used to separate galaxies from 
stars. In Figure 16, we see how stars form a well-defined trend. 
All these sources with magnitudes Tfhow ^ 23 were flagged as 
stars (similar magnitude limit as our final galaxy sample). Some 
residual contamination from stars could be expected among the 
remaining sources (galaxies) and vice versa. We pay special 
attention to this while selecting our final galaxy sample (see 
Section 3). 

A total of 234 stars were flagged in this way. Another 
71 stars (out of the demarcated area in Figure 16) were 
identified individually while checking individual galaxy spectra 
and photometry. Therefore, a total of 305 stars were flagged and 
removed from our 27 fields. 

APPENDIX D 

EFFECT OF DILUTED EMISSION LINES 
IN CONTINUUM FLUX 

We have studied the effects of diluted emission lines in the 
continuum in order to establish whether if this has an important 
effect on the SED fitting and therefore in the retrieved SPPs. In 
all the SED fits presented in this paper, regions with detected 
emission lines have been masked to include only flux from the 
continuum. Potentially problematic cases, however, come from 
undetected emission lines (e.g., low S/N, large FWHM) that 
might artificially increase the continuum flux used in the SED 
fits. We explored these cases by modeling passive SEDs with 
emission lines. With these simulations we attempted to model 
the worst possible scenarios for the detection of emission lines. 
The models were made as follows. 


The contamination from foreground stars was removed us- 
ing a magnitude-size diagram. The Tfhow measurements and 
total-light radius estimations from Source Extractor software 
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1 . We use the Bruzual & Chariot (2003) model of a typical 
quenched galaxy (r = lOMyr, M star = 10 11 M Q , SFR = 
0, solar metallicity, A v = 0 ) and include emission lines 


The Astrophysical Journal, 778:126 (24pp), 2013 December 1 


Bedregal et al. 



log(Flux 0URS G102) [erg s 1 cm 2 ] 

Figure 15. In this comparison between aXe’s and our ID spectral extractions, 
we illustrate the differences between their integrated fluxes in G102 grism 
(8000-1 1250 A). aXe systematically overestimates the fluxes between 10% and 
50% from bright to faint galaxies. Blue symbols correspond to a /fiiow ^ 23.0 
selected sample from WISP fields. Black symbols correspond to the 41 galaxies 
used in this study (see Section 5.1). Red continuous line is the 1:1 relation. Red 
dashed lines are the ±20% deviations from the 1:1 trend. 

(A color version of this figure is available in the online journal.) 

represented as Gaussians. We model the lines [O m ]5007 
and Ha as those are the most common features detected in 
our data set. In a first step, a variety of EWs (between 
35 A — our detection limit — and 500 A) and intrinsic 
FWHMs (between 40 A — the pixel size on G141 — and 
300 A) were explored for these lines. 

2. The spectra were redshifted to z = 1.3 so the [O hi] and Ha 
lines are both in G141, the poorer resolution grism. Then 
we degraded the spectra to G141 resolution. We considered 
the larger angular size of a galaxy in our sample along the 
dispersion direction of the slitless spectroscopy (525 A in 
G141) to mimic the worst-case resolution where emission 
lines can be easily missed. 

3. In these simulations we use a noise range characteristic 
for our galaxy sample. The S/N per pixel typically ranges 
between 20 and 80 with a median of 40 (corresponding to 
an S/N per A of 0.8 in G141). 

4. We produce a library of models with different combinations 
of S/N, line EW, and FWHMs. Those cases where the 
emission lines are lost in the continuum were flagged. The 
task was performed both by using the automatic line finder 
of Colbert et al. (2013) and by eye. 

5. This resulted in a set of parameter ranges between which 
the emission lines are not detected (being diluted in the 
continuum): EW: 35-50 A (above 50 A the lines are always 
identified); FWHM: 50-270 A; S/N: 30-80 (below 30 the 
lines cannot be distinguished from the noise). 
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Figure 16. Size vs. 7 fi low diagram shows that contamination from stars (sources 
within the dashed red area) can be removed as they define a distinctive sequence 
with respect to galaxies. Black symbols are all /fiiow detections in photometry. 
The dashed red lines define a star-selection area for sources brighter than 
/fiiow ^ 23. 

(A color version of this figure is available in the online journal.) 

In Figure 17, we show examples of six combinations of 
S/N, EW, and FWHM that cover the parameter space where 
the lines are not detected (six for Ha and six for [Om]). Black 
represents the pure continuum spectra, while color represents 
the spectra+emission line. 

For the six combinations of parameters we run 100 MC 
simulations with different random noise. Then we calculate 
the mean flux within 3a of the emission line in both the 
pure continuum and continuum+emission spectra. Finally, we 
estimate the percentage increase in flux in the emission-line 
cases with respect to their pure continuum spectra. In Figure 1 8, 
we show those percentages versus S /N (Ha in blue, [O hi] in 
red). Large symbols are the means, while the small “x” symbols 
are the 100 MC simulations for each case. 

As we see, the differences in flux are <7%, which is below 
the >15% flux uncertainty in the grism spectroscopy. Therefore, 
in those cases were emission lines are diluted in the continuum, 
the flux contribution from the line is lower than our la errors 
in the grism data. This implies that as long as we mask the 
detected emission lines for our SED fits, there is no need to 
make specific modeling of emission lines for our SED fitting 
process and simulations. 

APPENDIX E 

SYSTEMATIC UNCERTAINTIES IN REDSHIFT AND 
STELLAR POPULATION PARAMETERS 

In this appendix, we present a study on the systematic 
effects of SED fitting in our data. Our aim is to constrain 
for which specific data sets (e.g., with UVIS detections or 
upper limits, with or without IRAC) and SPP ranges our 
predictions are reliable, also providing systematic errors for 
the different SPPs. Our study is based on simulating our data set 
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Figure 17. Model examples of combinations of S/N, EW, and FWHM that 
cover the parameter space where emission lines are not detected. Top panels 
show Ha. Bottom panels show [O hi]. Black represents continuum spectra; color 
represents continuum+emission line. 

(A color version of this figure is available in the online journal.) 

at different redshifts with stellar population synthesis models to 
later recover the (known) redshifts and SPPs of these models 
using our customized IDL x 2 minimization code (Sections 4.1 
and 4.2). 

We simulate our photometric and spectroscopic data by 
using Bruzual & Chariot (2003) models. The complete single- 
burst model library considered to retrieve SPPs of our galaxy 
sample was used here (see Section 4.2 for the different SPP 
ranges considered). At this point we want to mention that 
assuming that the SFHs of our sample galaxies have a single- 
burst component is a simplification. Real galaxies may (and 
probably do) have more complex SFHs, like those that include 
multiple SBs during their lives. A proper exploration of these 
scenarios for the estimation of systematic errors, however, 
would substantially increase the number of models to be tested. 
Among others, multiple combinations of different numbers of 
SBs, occurring at different times, with different intensities and 
e-folding timescales would be needed. In the literature, such a 
number of simulations are not usually (if ever) attempted for the 
kind of analysis presented in this appendix, mainly because 
the immense amount of computational time is prohibitive. 
Therefore, we decided to limit our simulations to the original 
single-burst model library. As a consequence, we remind the 
reader that our systematic error estimates should be considered 
lower limits. 

The model SEDs were re-scaled to agree with typical data 
broadband fluxes and corresponding stellar masses. We multi- 
plied all the SEDs for a given factor such that a typical galaxy 



Figure 18. Percentage differences between fluxes in the model continuum and 
continuum + emission spectra are <7% for all S/N (Ha in blue, [Om] in red). 
Small “x” symbols are 100 MC simulations for each case, while large symbols 
are the mean values. 

(A color version of this figure is available in the online journal.) 

in our sample (exponentially declining SFH with r = lOMyr, 
2 Gyr old, no star formation, and no extinction) would have a 
stellar mass of 10 11 Mq. Variations in the stellar mass between 
10.6 and 11.8 dex (the range of our data) do not produce any 
significant change in the results of our simulations. Therefore, 
most of the results presented here correspond to an input model 
mass of «H0 U M 0 (a mild spread in input mass, typically of 
± 1 x 10 10 M q , occurs given the different combinations of model 
SPPs). 

Redshifts between 0.6 and 2.4 in steps of 0.2 were tested, 
and the model SEDs were redshifted accordingly. Using a 
wide redshift range for the models allowed us to explore the 
effects of redshift misidentification in the error budget and 
derived quantities. For z < 0.6, none of the models pass our 
7fiiow — Hpi6ow A 0.6 galaxy selection criterion. At z > 2.4, 
the contribution to the mass function integral is very small 
(Marchesini et al. 2009). 

We then degraded the models to G102 and G141 spectral 
resolutions to obtain our simulated grism spectroscopy. Random 
noise was also added to these models in order to match our data. 
A per pixel of ~40 was adequate to represent most of our galaxy 
spectra. Then the model spectra were binned in wavelength to 
reproduce the slitless-resolution dependency on the angular size 
of the source in the dispersion direction. For the angular size, 
we use the median luminosity profile FWHM of our sample 
galaxies in Jpnow band, corresponding to five pixels in the 
WFC3 detector (125 and 237 A in G 102 and G 141 , respectively). 

In addition to simulating the grism spectra, we use the model 
SEDs to simulate broadband photometry (F475X, F600LP, 
FI 10W, and F160W filters). For the two WFC3/UVIS bands 
we considered a typical data lcr uncertainty of 0.15 mag. Their 
fluxes were randomized using a Gaussian distribution with 
this width. Then we compared the model UVIS magnitudes 
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Figure 19. Left: peak-normalized histogram of weights for retrieved redshifts and SPPs as a function of zin (black). Red represents the smoother function used for our 
calculations. The weights are from the redshift distribution of CANDELS (Grogin et al. 2001) after selecting galaxies with the same color-magnitude criteria used 
in our data. Left: systematic error in zout vs. zout- Weighted mean systematic offsets for nine bins (0.1 wide in zout) are shown as black dots. Error bars represent 
the 68%. In the bottom we show the color code for zin and in parentheses the number of models that pass our color-magnitude selection criteria. The positions of 
individual data points have been slightly randomized for presentation purposes. 

(A color version of this figure is available in the online journal.) 


with typical 3a upper limits from our galaxy master sample 
(25.4 and 24.9 mag in F475X and F600LP, respectively). If 
a model UVIS magnitude was brighter than the limit, it was 
considered as a detection with its corresponding uncertainty. 
If equal to or below the limit, the UVIS magnitude was 
assigned the 3a upper limit for the band and considered as such 
for the x 2 minimization process. We measured and randomized 
the IRAC 3.6 /tm photometry in an analogous way with a 
characteristic error of 0.3 mag. 

Once we had our SED library redshifted and simulating real 
data conditions, all the models passed through the same selection 
criteria in color (/fhow - 7/pi6ow ^ 0.6) and magnitude (//fi60W 
^ 23.0) as our galaxies. Then we used our customized IDL code 
to recover redshifts and SPPs of those models that passed the 
selection. In what follows, the original redshifts and SPPs from 
which the model SEDs were made are referred to as “input 
parameters” (or sub-index “IN,” e.g., zin, Agei N ), while the 
redshifts and SPPs retrieved by our IDL code are referred to 
as “recovered parameters” (or sub-index “OUT,” e.g., zout, 
Ageour)- 

In order to make a reliable representation of our data set, 
we need more than retrieving the galaxy SPPs and modeling 
the specific features of the photometric and spectroscopic 
data. When observations cover a given redshift range, different 
selection effects are present, like the different cosmic volumes 
and regions of the luminosity (mass) function sampled at 
different redshifts. We took into account these effects in our 
systematic error calculations by assigning a “relative weight” 
to each simulated SED as a function of redshift (z IN ). The 
selection effect was introduced as a normalized multiplicative 
factor for a retrieved redshift or SPP (e.g., zout, Ageour)- The 
weights come from the redshift distribution of galaxies from 
CANDELS (Grogin et al. 2001) after selecting sources with the 
same color-magnitude criteria used in our data. A normalized 
histogram of the redshift distribution is shown in the left panel 
of Figure 19, together with a smoothed version used in our 
calculations. 

In the right panel of Figure 19, we show the systematic errors 
in Zout versus zout with median systematic offsets for nine 


redshift bins (black dots) and their 68% percentiles. In this plot, 
we clearly see that for zin ^ 1.0 the redshifts are completely 
misrecovered, having in most cases values zout ^ 1-0. The 
fact that we were not sampling the 4000 A break with the 
spectroscopy at low z has a relevant systematic effect for 
Zout ^ 1-2. As a result of these redshift-recovering tests, we 
decided to remove from our galaxy sample all sources with 
recovered spectrophotometric redshifts <1.0 as they are not 
reliable. For redshifts ^1.0, the systematics are consistent with 
zero, within the uncertainties. We present those uncertainties in 
Table 2. 

As retrieving spectrophotometric redshifts is the first step 
in characterizing our galaxy sample, any uncertainty in the 
systematic error of this parameter will have repercussions in 
the recovered SPPs. For this reason, the systematics for most of 
the SPPs shown in Table 2 are also presented at different redshift 
ranges. 

In Figure 20, we show comparisons between input and 
output age, SFH, extinction (Ay), stellar mass, and SFR for 
0.6 ^ zin ^ 2.4 models. The different colored in symbols and 
histograms represent different zin as described in the legend. 
The continuous red lines indicate no difference between input 
and output values. 

In panel (A) of Figure 20, we show the percentage difference 
between input and output ages (Aage) versus Age OUT . The 
median systematic offsets (black dots) clearly show large 
uncertainties for ages ~1 Gyr. We notice, however, that the 
larger contribution to these offsets comes from low zin where the 
4000 A break is not sampled with the spectroscopy. On the other 
hand, we found an important degeneracy between age and Ay. 
This degeneracy also contributes to the systematics in Age 0UT - 
Aage described before, and its strength strongly depends on 
how well can we constraint these SPPs at different redshifts. 
The Aage versus Age 0UT systematics become more evident at 
lower redshifts where we lack the spectroscopic constraint of the 
4000 A break. But even at larger redshifts, we also observe some 
degree of degeneracy between young underestimated ages and 
high overestimated extinctions. Taking all this in consideration, 
we decided to proceed as follows. We only considered ages at 
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Figure 20. Comparison between input and output SPPs for 0.6 ^ sj 2.4 models and stellar mass ~ 1 0 1 Mq. In panel (A), we show the percentage difference 
between input and output ages vs. Age OUT . Black dots with error bars represent weighted mean (%) systematic offsets and 68% percentiles in bins of 1 Gyr in Age 0 uT- 
The red dots and error bars only consider models with zout ^ 1.3. In panel (B), we show SFHout vs. SFHin- In (C) we present the difference between input and 
output extinctions vs. Avout- Dots with error bars represent weighted mean systematic offsets and 68% percentiles in bins of 0.2 mag. In panel (D) we show histograms 
for recovered stellar mass (~ 1 0 1 1 Mq) at different redshifts. Vertical dashed lines represent the median of each distribution. The black dot with error bars represents 
the weighted mean (%) systematic offset in stellar mass. In (E) we present the difference between input and output SFRs vs. SFRout- Dots with error bars represent 
weighted mean systematic offsets and 68% percentiles in different bins. The last dot with an arrow represents the systematic offset for SFRout > 20 Mq yr -1 . The 
different colored “x” symbols and histograms represent different zi\ as described in the legend (number of models per zin in parentheses). 

(A color version of this figure is available in the online journal.) 


Zout ^ 1.3 as constrained enough in order to be discussed in this 
paper. The extinctions and lower-redshift age estimations are too 
degenerate to make meaningful statements based on them. As 
for the redshifts, the systematics in Age 0UT are consistent with 
zero within the uncertainties. We show these results in Table 2. 

In panel (D) of Figure 20, we show histograms for the 
difference (in %) between recovered and input stellar masses. 
In this particular case, we show models with M star 10 11 M 0 , 
as this is a representative mass for our galaxy sample and there 
are no strong variations in the results within our galaxy mass 
range. The stellar mass is the most robust SPP retrieved with 
our data. Between 1.2 ^ Zout ^ 1.8 there are virtually no 
systematic offsets within log (Mstar, out/^o) = 10.6-11.8 
(^10%). At Zout < 1.2, systematic offsets become more 
relevant, varying with stellar mass from 30%, 20%, and 10% 
at log (M star /M 0 ) = 11.8, 11.0, and 10.6, respectively (on 
average). These systematics are shown in Table 2. 

For the remaining two SPPs (SFR and SFH) different com- 
binations of data sets could determine whether the recovered 
values are reliable (or not). In panel (B) of Figure 20, we show 
output versus input SFHs. From this figure we highlight two 
points. First, for our entire redshift range we can distinguish 


between short (r 100 Myr) and long (r ^ 1 Gyr) SFHs, 
although we cannot separate SFHs with r = 100 Myr and 
r = 10 Myr. Second, short SFHs (r ^ 100 Myr) are recovered 
independently of the data set used to derive redshifts and SPPs. 
For more extended SFHs (r ^ 1 Gyr), however, we found that 
we are able to recover similarly extended SFHs only for data 
sets with two UVIS detections independently of the redshift. 
Overall, these simulations imply we were able to broadly dis- 
criminate between r above and below 1 Gyr. 

In panel (E) of Figure 20, we present the difference between 
output and input SFRs versus SFRout- Over the entire redshift 
range we can successfully recover SFRout ^ 10M Q yr _1 
with uncertainties ^3 Mq yr 1 . For SFRout > lOM 0 yr _1 , 
however, the uncertainties in the systematics increase steadily, 
reaching 2OM 0 yr _1 at SFRout > 2OM 0 yr _1 . The different 
retrieved redshifts do not make much of a difference on this 
trend, but only increase the scatter on the median systematic 
errors at high zout- For this reason, in Table 2, we present the 
systematics in SFR independently of redshift. 

The strong systematic overprediction of the SFR for 
SFRout > 20 M q yr 1 , however, does not have a signifi- 
cant impact in our paper. First, we base the science on a 
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Figure 21. Same as Figure 20 but for SED fits without IRAC 3.6 fim data. Analogously to Figure 19 (right panel), in panel (F), we show the weighted mean systematic 
uncertainties in zout for SED fits without IRAC data. The zin color code is the same as in Figures 19 and 20. 

(A color version of this figure is available in the online journal.) 


quenched galaxy sample that always has the low SFRs we can 
successfully retrieve. Second, we do not make claims based on 
the precise values of the SFRs. We only use it through the spe- 
cific SFR to identify quenched galaxies (SSFR < 10~ 2 Gyr -1 ). 
As no further claims are made based on SSFRs, the relevant test 
is to determine whether an intrinsically quenched/SF galaxy 
can be recovered as such with our procedure, independently of 
the exact value of their SSFR. 

Stellar population properties such as SSFR and SFH are 
closely linked. For example, at the time of observation a galaxy 
with an extended SFH is more likely to show a higher SSFR 
than a short-burst galaxy. Also, our ability to constrain these 
properties is tightly related to the constraints we can provide 
with the data. Of particular importance is the number of UV1S 
detections available to constrain these SPPs. Detections in the 
rest-frame UV are more likely to occur in SF galaxies. For these 
reasons we decided to explore the success in recovering SSFRs 
and SFHs for different UVIS data sets in our entire redshift 
range. In this exercise we did not explore individual values of 
SSFR and SFH but ranges within which these parameters are 
reliable. For the SSFR two regimes were defined as above/below 
1 0 -2 Gyr 1 (our quenched/SF selection criterion). As already 
mentioned in previous paragraphs, SFHs are defined as short 
SFHs if r < 1 Gyr and extended if r ^ 1 Gyr. 

Our results show that the reliability of the recovered SFHs 
and SSFRs depends on the constraints from the UVIS data. 
At all redshifts we reliably recovered the quenched and 


short-SFH models (the relevant sources for this paper). This 
was independent of the level of constraint from UVIS data (two 
upper limits, one and two detections). On the contrary, and just 
to give the complete picture to the reader, SF and extended-SFH 
models were harder to constrain. Only those models with two 
UVIS detections provided reliably recovered SFHs and SSFRs. 

In Figure 21, we present the systematic uncertainties of all our 
SPPs and redshifts for SED fits on which no IRAC 3.6 /xm data 
were used (referred as “no-IRAC” from now on). A summary of 
the systematic errors for no-IRAC models is shown in the right 
column of Table 2. We observe that ~15 % of the model SEDs 
experience convergence issues during the / 2 minimization. This 
percentage is fairly constant for our entire range of zin models. 
In a closer analysis of these cases we found that models with 
Av,in > 0.8 mag are frequent among the nonconverging cases, 
particularly at zin < 1.0. This does not imply, however, that 
a majority of high- Ay in models do not converge. During the 
selection of our galaxy sample, the cases of nonconvergence 
for no-IRAC galaxies were very few. In any case, given the 
number of no-IRAC galaxies in our final sample, 15% of failed 
convergence would imply that at the most we are losing one 
galaxy for this reason. Overall, we conclude from this analysis 
that the IRAC 3.6 /xm data make it easier to constrain sources 
with high extinction. 

As a test, to avoid misinterpreting the slightly smaller number 
statistics of no-IRAC simulations with respect to the IRAC 
counterpart, we remove from the latter the non-convergent 
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models of the former. We found that the systematic errors and 
dispersions of the simulations with IRAC barely change after 
removing those models. Therefore, we can confidently interpret 
differences among IRAC/no-IRAC models as the consequence 
of lacking the 3.6 /rm data. 

For all the SPPs no major changes were observed in the 
median systematic errors by using or not IRAC data. For zout, 
no changes in the systematics were observed with respect to 
the simulations with IRAC. The systematics in stellar ages 
remain fairly stable, with a 10% increment in the systematic 
uncertainties for zout > 1.8. Only a reduction from 75% to 65% 
appears for the younger ages at 1.3 Zout ^ 1.8. However, as 
the uncertainties for that age regime are very large, we decided 
to keep 75% of error in the systematics for the no-IRAC models 
for those age and redshift ranges. 

The SFHs are still well recovered within the broad categories 
defined above (exponentially declined with r ^ lOOMyr 
and ^1 Gyr). The no-IRAC model runs only suggest a larger 
fraction of misidentified Tin = 5 Gyr as tout = 1 Gyr. Also, 
without IRAC, fewer models are misclassified as exponentially 
increasing SFHs. 

The lack of IRAC data has limited effect on the median 
systematics of stellar mass. It mostly widens the distributions 
shown on panel (D) of Figure 21, where the stronger effects 
are observed at high zin. This implies that the stellar mass 
of a galaxy is probably better constrained by using IRAC, 
particularly at high retrieved redshift. However, this does not 
translate into a larger average offset in the systematics on no- 
IRAC models: only at zout ^ 1.8 is a mild increment of 5% in 
the systematic error measured. 

The SFRout suffers from minor variations in the uncertainty 
on the systematics for no-IRAC models. As might be expected 
at this point, the lack of IRAC data does not prevent us from 
constraining the SSFR as we did for the models with IRAC 
photometry. 

In summary, our study of the systematic uncertainties in the 
SED fitting procedure allowed us to identify those specific data 
sets (combinations of photometry and spectroscopy) that did 
not provide enough constraints to reliably recover certain SPPs. 
Apart from Ay and ages from sources at z < 1.3, quenched- 
galaxy SPPs are fairly well constrained independently of the 
data set used. Concerning the actual systematic uncertainties we 
retrieved, there is no significant bias in the derived stellar masses, 
(S)SFR, SFH, and ages (the latter at z > 1.3) for galaxies with 
and without IRAC photometry. Our ability to constrain SPPs 
with our current data set is in part constrained by our uncertainty 
in redshift determination, for which the 4000 A break is the 
key ingredient. A summary of the systematic errors for each 
discussed SPP is presented in Table 2. 

Some final remarks: We should mention that systematic 
uncertainties would increase/decrease in the case of an IMF 
variation within our galaxy sample. The variation in the IMF 
is a very debated topic where most of the recent developments 
are restricted to local galaxy data. A recent paper by Cappellari 
et al. (2012) on local early -type galaxies suggests the existence 
of a range of IMFs from “heavier Salpeter” to Chabrier cases. 
In this work, however, and for the specific mass range of our 
galaxy sample (assuming that log (M star /M 0 ) ^ 10.65 galaxies 
have velocity dispersions ^200 km s -1 ), there is no correlation 
between IMF and stellar mass. Galaxies in this mass range 
spread from heavier Salpeter to Chabrier-like IMFs (see also 
Spiniello et al. 2013). Mitchell et al. (2013) present a theoretical 
study on the systematic effects of SED fitting in stellar mass 


estimations. For different stellar population synthesis models 
(a Bruzual & Chariot version and Maraston 2005) and IMFs 
(Kennicutt 1983 and a non-universal IMF), these authors find 
no IMF dependency on mass estimations in our mass range. For 
high masses, the scatter in the systematics is large independently 
of model/IMF used, with dispersions ranging from 20% to 
300% the value of the original input mass. All these results may 
imply that IMF variations are a galaxy-to-galaxy phenomenon, 
information we certainly lack. Therefore, we chose to use a 
unique Salpeter IMF for our calculations as it seems to be 
“representative” of the galaxy population we are studying. It 
also allowed us to directly compare our results with similar 
stellar population studies in the literature. 
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